A Practical Partial Parser for Biomedical Literature Summarization

نویسندگان

  • Yasunori Yamamoto
  • Toshihisa Takagi
چکیده

We present a partial parser called TeLePaPa (TextLens Partial Parser) to identify subjects and predicate verbs (SPVs) in a sentence of abstracts of MEDLINE citations. The performance of TeLePaPa is the precision of 96.7% and 97.1% for the SPV detection, respectively, and the recall of 91.3% and 94.9%, respectively. We found that there was a similarity in the distribution of the pairs of SPV over different research topics in the domain. In addition, we found that the power law holds for the relationship of the number of citations uncovered by SPV pairs and its rank. That is, only a half of the pairs covered about 90% of all the citations. This fact enables us to efficiently scan the huge amount of biomedical literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

APOLN: A Partial Parser Of Unrestricted Text

In this paper, we present APOLN (Analizador Parcial de Oraciones en Lenguaje Natural): a partial parser of unrestricted natural language sentences based on finite-state techniques. Partial parsing has been used in several applications: syntactic parsing of unrestricted texts, data extraction systems, machine translation, solving the attachment ambiguity, speech recognition systems, text summari...

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

Abstraction Summarization For Managing The Biomedical Research Literature

ion Summarization for Managing the Biomedical Research Literature Marcelo Fiszman Thomas C. Rindflesch Halil Kilicoglu Lister Hill National Center for Biomedical Communications National Library of Medicine Bethesda, MD 20894 {fiszman|tcr|halil}@nlm.nih.gov

متن کامل

Resolving ambiguity in biomedical text to improve summarization

Access to the vast body of research literature that is now available on biomedicine and related fields can be improved with automatic summarization. This paper describes a summarization system for the biomedical domain that represents documents as graphs formed from concepts and relations in the UMLS Metathesaurus. This system has to deal with the ambiguities that occur in biomedical documents....

متن کامل

Citation Handling: Processing Citation Texts in Scientific Documents

Title of thesis: CITATION HANDLING: PROCESSING CITATION TEXTS IN SCIENTIFIC DOCUMENTS Michael Alan Whidby Master of Science, 2012 Thesis directed by: Professor Bonnie Dorr Dr. David Zajic Department of Computer Science Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004